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Abstract — In network quality of service provisioning, premium 
services generally require to lieep a very small loss probability, 
which is infeasible to measure directly. The proposed virtual 
queue scheme estimates the small packet loss probability of a 
real queueing system by measuring queue statistics in a set 
of separate virtual queues. A novel scaling property between 
the real queue and the virtual queues is deduced on the basis 
of the maximum variance asymptotic (MVA) theory. The new 
scheme retains the high accuracy and wide applicability of 
the MVA method for aggregated traffic while avoiding the 
high computational complexity in a direct application of the 
original MVA analysis in real time. This makes it suitable for 
online measurement applications such as network performance 
monitoring and measurement-based admission control. 

I. Introduction 

In the framework of quality-of-service (QoS) provision- 
ing in the Internet, online QoS estimation (or monitoring) 
plays a significant role in dynamic resource allocation and 
measurement-based admission control. As one of the most 
important QoS parameters, the packet loss probability (or 
buffer overflow probability) should have very small values 
in practical network operations to provide satisfying services, 
e.g., in the magnitude of 10^® ^ 10"^. However, a direct 
online measurement of such a small loss probability is difficult 
due to the large variability of the measurement values. 

QoS estimator based on virtual queue (VQ) R], H treats 
loss probability as a function of the buffer size when the 
input traffic remains the same. A specific type of function 
is presumed for the loss probability. Then, separate virtual 
queues with reduced buffer size are derived from the real 
queue. A copy of the real input traffic is injected to each 
of the virtual queues. As a result, a virtual queue sees a 
relatively large overflow probability that is feasible to be 
measured. On the basis of the measured loss probabilities in 
virtual queues, the loss probability of the real system can be 
calculated by interpolation according to the presumed shape 
of the loss probability function. VQs are implemented by 
means of counters/timers, meaning low realization overhead. 
However, the queue responses in the Internet are complex and 
dynamic. This makes the assumption of knowing a priori the 
loss probability pattern rigid or infeasible. As a consequence, 
the applicability of such VQ schemes is much constrained. 

A more general estimation approach is to measure the 
statistics of input traffic, from which the packet loss probability 



is further computed by queueing analysis. Generally, the more 
accurate the estimation, the more traffic statistics are necessary 
and the higher complexity the computation. Maximum vari- 
ance asymptotic (MVA) (3), f?! analysis provides queueing 
performance estimation when the input traffic can be modeled 
by a Gaussian process. This is typical in high-speed links 
where the traffic is aggregated from a large number of flows. 
The high accuracy of the MVA analysis was verified [SJ- 
|5| for a wide spectrum of traffic types including the long- 
range-dependent (LRD) traffic 1*61. This admirable feature is 
attributed to the generality of the Gaussian process and the 
central limit theory. However, the MVA analysis requires to 
measure the variance of traffic arrivals on different time scales. 
Among these, the most relevant time scale needs to be iden- 
tified by search operation with a complexity of 0(n), where 
n is the number of discrete time scales. The implementation 
complexity for the measurement and computation is high in 
real-time application environments. 

In this paper, we propose a VQ estimation scheme. While 
the proposed scheme also has a low implementation complex- 
ity as general VQ approaches |[l], it makes no assumption 
on knowing a priori the changing pattern of the loss probability 
with buffer size. Instead, a novel scaling property is adopted, 
which is derived on the basis of the MVA analysis to map 
between the VQs and the real queue. Particularly, the config- 
uration of VQs and the mapping between loss probabilities in 
real/virtual queues are both guided by the scaling property. The 
scheme thus inherits the high accuracy and wide applicability 
of the MVA method for aggregated traffic while avoiding the 
complexity in a direct application of the original MVA analysis 
in real time. Heuristic analysis shows that the proposed scheme 
achieves a significant variance-reduction-effect in estimating 
very small loss probability. 

In the rest of this paper. Section II reviews the original MVA 
analysis of buffer overflow probability. The new VQ estimator 
is introduced and analyzed in Section III. Section IV evaluates 
the performance of the VQ estimator by simulation. The paper 
is concluded in Section V. 

II. Maximum Variance Asymptotic Analysis 

Consider a single server system with a first-in-first-out 
(FIFO) buffer of infinite size. The service rate is c. Let Q 
denote the queue length. P{Q > q} is the tail probability of 



the queue length. Here, the parameter q will be referred to as 
the threshold. 

Let A{t) denote the amount of traffic arrival within an 
arbitrary time interval of length t. The mean traffic rate is 
r. The time-scale-dependent variance of the traffic is further 
defined as v{t) — VAR[^(t)]. In case the traffic is aggregated 
from a large number of sources, according to the central 
limit theorem the distribution of A{t) can be approximated 
by a Gaussian distribution. In light of this, the tail probability 
P{Q > q} is dominantly influenced by the variance v{t) on 
a specific time scale r called the dominant time scale (DTS) 
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In other words, r is the value of t that minimizes the function 
^(g, c, t). It can be further obtained by the MVA analysis f3l: 
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While Eq. (|2|l provides a tight approximation of the tail 
probability P{Q > q} for infinite buffer size, in practice 
the loss probability (or buffer overflow probability) is mostly 
concerned because the real buffer size is always bounded to 
limit the realization cost or to restrict the maximal queueing 
delay. Let Phix) denote the loss probability in a buffer of 
limited size x. In P], it is proposed to calculate Pl{x) by: 
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Here, P{Q > x} and P{Q > 0} are interpreted in terms of 
tail probability measures in the circumstance that the system 
would have infinite buffer size. They are further calculated 
by Eq. Pl(0) is the loss probability if the system were 
bufferless. Pl{0)/ P{Q > 0} turns out to be a scalar. Its 
approximate solution is given in |4|, which relies on an integral 
operation. 

Despite the advantages in the accuracy and generality, 
the MVA analysis with Eq. ([T][3]l involves measurements of 
the variance process v{t) over an extensive range of time 
scale t [7\ as well as the search for DTS to calculate the 
infimum. In practice, v{t) can only be measured on discrete 
time scales A, 2 A, nA where A is the smallest time scale 
that is relevant to the queueing performance. Typically, A 
is in the magnitude of single packet service time, which is 
very small on high speed links. This results in a large n. 
The computational complexity in updating the statistics v{t) 
amounts to 0{n). So does the complexity in search of the 
DTS. Consider the transmission of video transport streams 
with standard packet size of 188 bytes over a 100 Mbps 
link, n amounts to 6649 in order to cover time scales up to 
100 ms. This is very demanding in the implementation for 
online estimation. 



III. Virtual Queue Estimator 

The proposed VQ estimator aims to determine the very 
small loss probability Pl{x) for a single server system with 
a FIFO buffer of size x. This is called the real system in the 
following sections. MVA analysis is applied to derive a novel 
scaling property. By taking advantage of the scaling property, 
the loss probability can be computed from three probability 
statistics respectively measured in three VQs. Consequently, 
the measurement of the variance process v{t) and the search 
for the DTS are not necessary any more. 

In this section, the basic concept of the VQ estimator is 
firstly addressed. The scaling property is derived by MVA 
analysis. Then, a heuristic analysis is performed to show the 
variance reduction achieved by the VQ estimator. On this 
basis, a guideline for the parameter configuration is outlined. 

A. VQ Concept and Scaling Property 

To apply Eq. (O in the VQ paradigm, a total of three 
VQs are constructed, denoted by VQl, VQ2 and VQS. They 
are configured to obtain the measure ^1(0), P{Q > 0} and 
P{Q > x}, respectively. The input traffic in each VQ is a 
copy of the real input traffic. Among the three VQs, VQl and 
VQ2 have the same service rate c as the real system. However, 
VQl is bufferless so that the loss probability Pl{0) can be 
easily measured. VQ2 has an infinite buffer size in order to 
give the measurement of P{Q > 0}. Note that Pl{0) and 
P{Q > 0} have relatively large values. So, the correspondent 
measurements in the VQl and VQ2 are straightforward. As a 
result, the scalar Pi^{0) / P{Q > 0} in Eq. (O is obtained from 
VQ-measurements. The complex numerical calculation in 
is omitted. 

VQ3 has an unlimited FIFO buffer for the estimation of the 
tail probability P{Q > x}, which is too small to be measured 
directly. The idea is to scale down the threshold x and service 
rate c to build the VQ3. The resulting tail probability in VQ3 
is thus raised to a level that can be practically measured. 
In the following, we use x', c' and P{Q' > x'} to denote 
the concerned threshold, service rate and tail probability in 
VQ3, respectively. The key to the design lies in mapping the 
measured tail probability P{Q' > x'} in VQ3 to the tail 
probability P{Q > x} according to a scaling property. 

To avail a scaling between P{Q > x} and P{Q' > x'} 
without the traffic measurement for v{t) and the calculation 
of DTS, it is necessary to keep an unchanged DTS in VQ3 
after the scale-down. With reference to Eq. ([T]i, we found that 
this can be assured if 



g(x',c',t) = k g{x,c,t), 
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where fc is a scalar independent of t. This condition is fully 
satisfied if we set: 
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where a : a > 1 is the constant scaling factor and r is 
the mean traffic rate. It leads to g{x',c',t) = g{x,c,t)/a. 



Inserting this into Eq. (O, we obtain the scaHng property 
between P{Q > x} and P{Q' > x'} 
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Eq. (O reveals how the tail probability P{Q > x} is 
mapped from the measurement of P{Q' > x'} in VQ3. 
Together with the measurements Ph{0) and P{Q > 0} from 
VQ1/VQ2, the loss probability Pl{x) of the real system is 
calculated according to Eq. (O. The overall computational 
complexity is as low as 0(1) in contrast to that of the MVA 
analysis. 

B. Evaluation of Variance Reduction 

Because Pl(0) and P{Q > 0} in Eq. Q can be measured 
in VQ1/VQ2 with relatively high credibility as described in 
Section III. A, the variance in the estimation of Pl{x) is mainly 
attributed to the estimation of the very small tail probability 
P{Q > x}. In the following analysis, we focus on the effect 
of P{Q > x}, neglecting that of P^O) and P{Q > 0}. The 
variance reduction of the VQ estimator is thus characterized 
by the ratio 77: 

^COV[pVQ{Q > a;}] 2 
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Here COV[-] denotes the coefficient of variation (COV). 
stands for the direct measurement of P{Q > x}. P^^{Q > 
x} denotes the estimation of P{Q > x} by the VQ approach. 
Because i;[P^Q{Q > x}] w E[p^] = P{Q > x}, Eq. © is 
simplified to 



V = 



VAR[P^Q{g > x}] 



(10) 



VAR[p,] 

where VAR[-] denotes the variance. 

Let Ux' denote the measurement of P{Q' > x'} in VQ3. 
According to Eq. (Ell, P^^{Q > x} ^ (u^')"'- Through 
linearization |[8l, the variance of P^^jQ > x} is: 
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Formally, the tail probability P{Q > x} is defined as the 
average ratio between the sum of the time duration in which 
Q > X holds and the total observation duration. For a heuristic 
analysis, however, we equalize this tail probability to the 
probability that an arbitrary packet sees Q > x upon its arrival 
[J. Correspondingly, we employ = M/N where M is the 

'The quantitative difference between this "packet-seen" probability and the 
"time-average" probability is generally small. 



number of packets seeing Q > x and is a constant denoting 
the total number of packet arrivals. So, E[px] — P{Q > x}. 
Assume that the probability that a packet sees Q > x is 
independent. M then follows a binomial distribution. So, the 
variance of p^ turns out to be: 

YAR[px]=P{Q>x}{l-P{Q>x})/N. (12) 

Similarly, the variance of iix' in VQ3 is: 

VAR[m:,,] = P{Q' > x'}{l - P{Q' > x'})/N. (13) 

Applying Eq. ^ and inserting Eq. (fTl1l-(fT3]l to Eq. ( fTOb . 
we obtain: 
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In Fig. [U 77 is plotted with respect to the scaUng factor a 
according to Eq. (Ell. P{Q > x} is set to 10"'', 10"*^, 10-^ 
respectively. At the beginning, 77 decreases very fast with the 
increase of a because a large scaling factor results in a high 
level of P{Q' > x'} which allows for a stable measurement 
iix'- On the other hand, the mapping operation Eq. ([8]) can 
also amplify the variance of iix'- Therefore, 77 increases for 
too large values of a. However, such increase is very slow 
with the change of a, meaning that 77 is insensitive to over- 
scaling. Actually for all three curves 77 becomes quite stable 
as long as a > 2.0. 

With Eq. (O, the minimal rj can be numerically calculated 
at different magnitudes of P{Q > x}, which is plotted in 
Fig. |2] marked with legend ?7n,in. It shows that the smaller the 
tail probability is, the more variance reduction can be achieved. 
This verifies the potential of the VQ scheme for the estimation 
of rare events. 

As a reference, we also look at a ratio (p = 
(COV[^]/COV[pj;])^. Here, | is the direct measurement of 
a reference tail probability that has a relatively large value 
of 0.01. In Fig. 121 the curve of locates quite close to 
the curve of riini„. It implies that the stability of the VQ 
estimation is comparable to the direct measurement of a 
probability in the magnitude of 0.01, which is quite practical. 
This promises a good applicability of the VQ scheme in real- 
time environments. 

C. Configuration of the Scaling Factor 



With a sa ^J\ogP{Q > x}/ log P{Q' > x'} obtained from 
Eq. dUl, Eq. (O is rewritten as: 
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Note that on the right hand side, the first term depends solely 
on P{Q' > x'} and the second term depends solely on 
P{Q > x}. Therefore, the minimization of 77 with respect 
to P{Q' > x'} needs only to consider the first term. By 
numerical computation, it is found that P{Q' > x'} ~ 0.2032 
in order to achieve a minimal 77. On this basis, a can be 
determined correspondingly using Eq. ^ for a specific range 
of the tail probability P{Q > x}. 




Fig. 1. Influence of ct on the variance reduction. Fig. 2. Tlie minimal r\ at different levels of Fig. 3. r) with fixed values of a. 

P{Q > x}. 



In practice, the real P{Q > a;} is a dynamic parameter. A 
fixed setting of a should provide a stable estimation over a 
wide range of the P{Q > x} values. In Fig. [3] we use Eq. (fT4] | 
and show the deviation of the ratio 77 from the minimal rj when 
a is fixed to 1.5, 2.5 and 3.5, respectively. It is seen that a 
large performance degradation only occurs in case of too small 
a — 1.5. With sufficiently large a = 2.5, 3.5, rj is quite stable 
and very close to the optimum. This is in consistency with the 
observations in Fig. [T] 

IV. Performance Evaluation 
A. Simulation Configuration 

The performance of the proposed VQ estimator is evaluated 
by simulation for a single-server FIFO queue system with 
bounded buffer size. The service rate is c = 100 Mbps. The 
VQ estimator uses three VQs, in which the statistics -Pl(O), 
P{Q > 0} and P{Q' > x'} are respectively measured for a 
common time duration. The duration is called measurement 
window. Naturally, larger measurement window results in 
more reliable measurement results. In VQ3, the buffer size 
x' is configured by Eq. (|5]l with scaling factor a — 2.5, 3.0 
respectively. The mean traffic rate r is sampled in terms of the 
average rate over the elapsed time in the current measurement 
window. VQ3 uses samples of r to periodically update its 
service rate c' according to Eq. (|6]l at constant interval of 
1 ms. The real loss probability is estimated at the end of 
the measurement window by Eq. (|3]l and dHJ. A measurement 
window always starts when the queue system is in the steady 
state after a long transient simulation phase (5000 s). 

Two types of input traffic are inspected. Type A traffic aims 
to simulate the traffic of audio services. It is the aggregation of 
homogeneous on-off sources. The on-period follows a negative 
exponential distribution with mean value 0.2 s. The off-period 
is negative-exponentially distributed with mean value 0.6 s. In 
the on-period, each source generates a bit stream at constant 
rate of 64 kbps. The bit stream is packetized into packets of 
constant length 500 bytes. 

Type B traffic is also a multiplexing of independent flows. 
Each flow is generated by a 3-states markov modulated 



Poisson process (MMPP) to simulate the traffic of video 
services. We adopt the setting in |9|: the state-dependent rate 
Ai — 83/s, A2 = 367/s, A3 = 661/.s; the state transition rate 
qi2 = 40,qi3 = 1,921 = 10,923 = 1-3, gai = 6,532 = 0.1. 
The packet length is constant 188 bytes. This setting was 
derived from realistic MPEG-4 video traces with an average 
bitrate of 540 kbps lH]. In either case (Type A/B), the system 
load p is configured by changing the number of traffic sources. 

B. Estimation Accuracy 

The estimation performance is first evaluated by setting a 
very large measurement window (25000 s) so that all the mea- 
surement results are stable. This aims to assess the estimation 
accuracy in steady states. The loss probability estimations by 
the VQ estimator are plotted in Fig.|4]and|5]for Type A (at load 
p = 0.8) and Type B traffic (at load p = 0.81), respectively. 
For comparison, the real loss probabilities measured by the 
long simulations are drawn with solid lines. 

In the figures, the VQ estimations show a good follow-up 
with the real loss probabilities. The estimation deviation is well 
constrained within one order of magnitude, which corresponds 
to the accuracy of the MVA method |4 |. This stands for a quite 
high accuracy among the existing estimation methods that are 
able to deal with generalized input traffic. 

C. Transient Response 

To illustrate the transient response of the VQ estimator, the 
loss probability under specific buffer size is estimated with the 
measurement window size constantly incremented at a step of 
20 ms. The results are plotted for Type A traffic in Fig. |6] and 
for Type B traffic in Fig. [T] The traffic setting is the same to 
that in Section IV.B. 

In both graphs, the VQ estimator gives a quick response 
with probable overshoot. The estimations fluctuate for a short 
time and converge soon. Stable estimations are obtainable with 
a measurement window as small as 3 s. This verifies the 
effectiveness of the VQ estimator for online loss probability 
estimation. The same evaluation is also performed for a direct 
measurement of the real loss probability. For the system 
scenario corresponding to Fig. |6] the direct measurement does 
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Fig. 4. Estimation accuracy for synthetic audio flows. 
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Fig. 6. Transient response for synthetic audio flows. 

not report any packet loss until 13 s. For the scenario of Fig.|7] 
the direct measurement records the first packet loss after 22 s. 
(The results are not shown in the graphs so as to focus on the 
response time-scale of the VQ estimator). The performance 
advantage of the VQ scheme is substantial. 

V. Conclusion and Outlook 

The proposed VQ scheme does not rely on any presumption 
of the loss probability pattern with respect to buffer size. 
Instead, a novel scaling property is derived based on the MVA 
analysis to map between the real queue and the VQs. The new 
scheme inherits the advantages of conventional VQ approaches 
and the MVA method: low implementation overhead, high 
accuracy and a general applicability for aggregated Internet 
traffic. These advantages make it a promising online estimator. 

Our work also opens a new way to connect the rich 
research field of queueing analysis to the real-time network 
performance estimation. We are looking forward to exploring 
further scaling properties to avail the development of advanced 
VQ solutions. 
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Fig. 5. Estimation accuracy for synthetic video flows. 
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Fig. 7. Transient response for synthetic video flows. 
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